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Abstract 

We consider the general problem of finding the minimum weight b-matching on arbitrary graphs. We 
prove that, whenever the linear programming (LP) relaxation of the problem has no fractional solutions, 
then the belief propagation (BP) algorithm converges to the correct solution. This result is notable in 
several regards; (1) It is one of a very small number of proofs showing correctness of BP without any 
constraint on the graph structure. (2) Instead of showing that BP leads to a PTAS, we give a finite 
bound for the number of iterations after which BP has converged to the exact solution. (3) Variants of 
the proof work for both synchronous and asynchronous BP; to the best of our knowledge, it is the first 
proof of convergence and correctness of an asynchronous BP algorithm for a combinatorial optimization 
problem. (4) It works for both ordinary b-matchings and the more difficult case of perfect b-matchings. 
(5) Together with the recent work of Sanghavi, Malioutov and Wilskly [41] they are the first complete 
proofs showing that tightness of LP implies correctness of BP. 

1 Introduction 

Motivated by the cavity method in statistical physics, very fast distributed heuristic algorithms have recently 
been developed for the solution of random constraint satisfaction problems [55], [T^j, [T3|, [T]. Similar 
heuristic methods have been known for many years |20j in the context of coding theory. And a variety 
of specific examples of such algorithms have been developed in artificial intelligence, signal processing, 
and digital communications. Well-known examples include the Viterbi algorithm, the iterative decoding 
algorithm in turbo codes and in low-density parity-check codes |38| . Pearl's belief propagation algorithm for 
Bayesian networks [35|, the Kalman filter, and certain fast Fourier transform (FFT) algorithms. Very recent 
applications can also be found in systems biology [TH], [52], [57], computer vision [33], and data clustering 

In some cases, the algorithms generated by the cavity method are exactly of the form of a classic belief 
propagation (max-product or min-sum) i.e., a message-passing algorithm for efficiently computing marginal 
probabilities or finding the assignment with highest probability of a joint discrete probability distribution 
defined on a graph. The belief propagation (BP) algorithm converges to a correct solution if the associ- 
ated graph is a tree, and may be also a good heuristic for some graphs with cycles. In other cases, the 
cavity method may lead to a more involved survey propagation (SP) algorithm [28] , in which some form of 
correlation among variables is controlled. 

In this paper, we study the problem of finding the minimum weight b-matchings in arbitrary graphs via the 
min-sum version of BP algorithirQ. 

Our Results. Let G = {V,E) be an undirected graph with edge weights Wij for each edge {i,j} G E 
and node capacities bi for each node i ^ V. The iterative message-passing algorithm based on synchronous 
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BP for solving the weighted perfect b-matching probkm (see our Section ^ for the precise definition) is the 
foUowing simple procedure: At each time, every vertex of the graph sends (real valued) messages to each of 
its neighbors. The message transmitted at time t from i to j is Wij minus the bf^ minimum of the messages 
previously received by i at time t — 1 from all of its neighbors except j. At the end of each iteration, every 
vertex i selects bi of its adjacent edges that correspond to the bi smallest received messages. 

We will show the following result: For arbitrary graphs G, and all sets of weights {wij}, after 0{n) iterations, 
the set of selected edges converges to the correct solution, i.e., to the minimum weight perfect b-matching of 
G, provided that the LP relaxation of the problem (see Section [2] for definitions) has no fractional solutions. 
Additionally we introduce a new construction, a generalized computation tree, which allows us to analyze the 
more complicated case of BP with an asynchronous updating scheme, and prove convergence and correctness 
of it when each edge of the graph transmits at least 0{n) messages. To the best of our knowledge, this 
technique is new and can be applied in the analysis of asynchronous BP in other problems as well. These 
are extensions of the previous results of [S] and [53] which showed convergence and correctness of the above 
algorithm for bipartite graphs^ Moreover, our proof gives a better understanding of the often-noted but 
poorly understood connection between BP and LP through the dual of the LP relaxation. We also modify 
our BP algorithm and its analysis to include the problem of finding the non-perfect weighted b-matchings. 
Recently and independently from our work a similar result for the scenario of using synchronous BP for 
non-perfect 1-matchings was shown by Sanghavi, Malioutov and Wilskey [41]. 

Related Works. The weighted b-matching problem is an important problem in combinatorial optimiza- 
tion. It belongs to a family of integer linear programs which have been well-studied and can be solved in 
strongly polynomial time |13| . [14| . For extensive surveys see |23] and |36j . In physics, the study of the ran- 
dom 1-matching problem goes back to the work of Mczard and Parisi [27] who made a celebrated conjecture 
for the expected optimum weight (tt^/G) that was proven to be exact a decade later by Aldous [2|. 

BP algorithms have been the subject of extensive study in several communities. The general BP algorithm is 
known to be correct on graphs with no cycles [35| . For graphs with a single cycle, convergence and correctness 
of BP have also been rigorously analyzed [5], [S2|. For arbitrary graphs, relatively little is known about the 
correctness of BP, although some interesting progress has been made in [S3] , [3S| , [31] , [SH] • Performance of 
the BP algorithm usually depends on the length of cycles in graphs; most analytical results require that the 
graphs have no short cycles (i.e., that they are large-girth graphs) [33, [3], [21]. For the case of weighted 
matchings and a few other problems, there were initially surprising results that BP works correctly on graphs 
with many short cycles ([55]. [39]. [5], [29]. [34]). 

Recent works have also suggested a connection between the BP algorithm and linear programming (LP) in 
particular problems. A relationship between iterative decoding of channel codes and LP decoding was studied 
in [in] , [37] , [3S] • Other relationships were noted in the context of BP algorithms with convex free energies 
[ST] . [50] . [55] . and in the case of BP algorithms for resource allocations [3T]. For weighted 1-matchings, the 
connection was studied [5] in the context of similarities between BP equations and the primal-dual auction 
algorithm of Bertsekas [8] . And it was further clarified recently for non-perfect 1-matchings in [40] and [41] 
where it was shown that BP does not converge to the correct solution if the LP relaxation has fractional 
solutions. Another recent result studies this connection for the weighted independent set problem [42]. We 
will compare our work with some of these results in the "Technical Contribution" section below. 

Finally, we note that the BP equations for solving the weighted matching problem which we use in this paper 
have been previously studied in [6], [24]. These equations are also very similar to equations for weighted 
matching problems and traveling salesman problems given in [27] , [48] , [2] , [21] , and to equations for various 
other problems given in [SS] , [37] , [55] . 

Technical Contribution. The main contributions of our results and techniques can be summarized as 
follows: 

^Both of these results were assuming that the minimum weight matching is unique. Note that if there is more than one 
solution, then one can construct a fractional solution to the LP relaxation. 
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1. BP for the weighted matching was first used in [5] and its correctness and convergence was sliown for 
bipartite graplis witfi unique optimum solution. Tliat proof relied heavily on the fact that the minimum 
weight matching of a bipartite graph is locally optimal on any cycle since the cycles of a bipartite graph 
have even length. The same technique was used in to extend the result to b-matchings in bipartite 
graphs. But this technique fails for graphs containing cycles with odd length. In order to bypass this 
difficulty we use a completely different tool, complementary slackness conditions of the LP relaxation 
and its dual, which is independent of the graph structure. 

2. Connection between LP and BP has been suggested and analyzed by various groups (as we discussed 
above), but our result together with [JT], to the best of our knowledge, are the first ones which show 
both convergence and correctness of the BP algorithm when LP relaxation has no fractional solutions. 
One related result, [H], studies only properties of the BP fixed points and their relation to the LP, 
conditioned on the convergence of the BP algorithm. Similarly in another recent work, |55| . which 
generalizes methods of [50] and [51], the connection of the BP algorithm and LP relaxation is studied 
in the converged case of the BP. The authors also study interesting variations of the BP which have 
convex free energies. 

3. The asynchronous BP, which includes the synchronous version as a special case, has been a more 
popular version for practical purposes. But, due to its more complicated structure, it has not been the 
subject of much rigorous study. To the best of our knowledge, our work is the first correctness and 
convergence proof of asynchronous BP for a combinatorial optimization problem. Another advantage 
of our proof is the construction of a new tool, the generalized computation tree, which can be used 
for the analysis of the both convergence and correctness of asynchronous message-passing algorithms 
including BP. Without the notion of a suitable computation tree the existing methods, free energy 
analysis [58] [49] or Lipschitz functions [21] [1], do not give correctness and convergence at the same 
time. 

Organization of the Paper. The rest of the paper is organized as follows. In Section [21 we provide 
the setup, define the weighted b-matching problem, and describe the LP relaxation, the dual LP, and the 
complementary slackness conditions. In Section [3] we describe our algorithm for the minimum weighted 
perfect b-matching problem, and state our main result. The analysis of our algorithm is given in Section [5] 
The extension of our algorithm and results to the non-perfect minimum weighted b-matching problem are 
presented in Section [5] Finally, in Section [6] we state the asynchronous version of the BP algorithm and 
present its analysis. 

2 Definitions and Problem Statement 

Consider an undirected simple graph G = {V, E), with vertices V = {I, . . . ,n}, and edges E. Let each edge 
{i,j} have weight Wij G K. Denote the set of neighbors of each vertex i in G by N{i). Let b = . . . , 6„) 
be a sequence of positive integers such that bi < degQ{i). A subgraph AI of G is called a h-matching (perfect 
h-matching) if the degree of each vertex i in M is at most bi (equal to bi). Denote the set of b-matchings 
(perfect b-matchings) of G by MG(b) (PMG(b)), and assume that it is non-empty. Clearly PMG(b) C MG(b). 

The weight of a (perfect or non-perfect) b-matching denoted by is defined by Wm ~ Tliij l{i,i}eJ\/- 
In the next two sections, we will restrict ourselves to the case of perfect b-matchings. We will extend the 
analysis to (possibly non-perfect) b-matchings in Section [5] The minimum weight perfect b-Matching (b- 
MWPM), A'/*, is defined by M* = argminj,/gp]y[Q(b) Wm- The goal of this paper is to find M* via a 
min-sum belief propagation algorithm. Throughout the paper, we will assume that M* is unique. 

Linear Programming Relaxation. Assigning variables Xij S {0, 1} to the edges in E, wc can express 
the weighted perfect b-matching problem as the problem of finding a vector x G {0, l}'^' that minimizes the 
total weight X]ije_B ^ij^ij^ subject to the constraints X]jejv(i) •^y ~ ^« i . Relaxing the constraint 
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that Xij is integer, this leads to the following linear program and its dual: 



mm 
subject to 



< x^j < 1 



Primal LP 



max 
subject to 



J27=lbiy^-J2{ 



+ k] >Vi + Vj 



A„ > 



Dual LP 



(1) 



We say the LP relaxation ([T]) has no fractional solution if, every optimal solution x of LP satisfies x £ 
{0, Note that absence of fractional solutions implies uniqueness of integer solutions, since any convex 
combination of two integer solutions is a solution to the LP as well. We want to show that the BP algorithm 
for our problem converges to the correct solution, provided the LP relaxation ^ has no fractional solution. 



Complementary Slackness Conditions. Complementary slackness for the LP and its dual state that 
the variables x* — {x*^) and y* ~ (y*), A* ~ (A*,) are optimum solutions to the LP relaxation and its dual 
(P), respectively, if and only if for all edges {i,j} of G both x*j{wtj + A*, -y* -y*j) = 0. and (x*, - 1)A*, = 
hold. See [TU], [13] for more information about LP, dual LP and complementary slackness conditions. 

Using the fact that the LP has no fractional solution, one can deduce the following modified complementary 
slackness conditions: For all {i, j} £ M*; Wij + X*j = y* + y* and for aU {i,j} ^ M*; A*, = 0. 

By these conditions and the fact that A*, > 0, we have that Wij < y* + y* for all {i,j} G M* , and 
Wij > y* + y* for all ^ M*. However, as the counterexample given in Appendix [XI shows, it is in 

general not true that these inequalities are strict even when the LP has no fractional solution. Let 5* be 
the set of those edges in G for which \wij — y* ~ yj\ > 0. We will assume the minimum gap is e. i.e. 
e = min^j j}g5 \wij —y* — y*j \ > 0. Throughout this paper we assume that there exist an edge in G for which 
the strict inequality \wij — y* — yjl > holds and therefore e > is well defined. The other cases, where for 
each {i,j} G E the equality Wij = y* + y* holds, happens only for special cases and are discussed in Section 
14.51 Let also L = maxi<i<„ \y*\ . 



3 Algorithm and Main Result 

The following algorithm is a synchronous implementation of BP for finding the minimum weight perfect 
b-matching (b-MWPM). The main intuition behind this algorithm (and, indeed, all BP algorithms) is that 
each vertex of the graph assumes the graph has no cycles, and makes the best (greedy) decision based on 
this assumption. This is shown in more detail in Section [4. II 

Before applying the BP algorithm, we remove all trivial vertices from the graph. A vertex i is called trivial if 
degaii) = hi. This is because all of the edges adjacent to i should be in every perfect b-matching. Therefore 
the graph can be simplified by removal of all trivial vertices and their adjacent edges. 

Algorithm Sync-BP. 



(1) At times t = 0, 1, . . ., each vertex sends real-valued messages to each of its neighbors. The message of 
i to j at time t is denoted by mi^j (t) . 

(2) Messages are initializecil by mj_»j(0) = Wij for all {i,j} G E. 

^We show in Section 14.41 that the messages can be initialized by any arbitrary values. 
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(3) For t > 1, messages in iteration t are obtained from messages in iteration t — 1 recursively as follows: 



V{t,j}eE: 



1) 



(2) 



where fc*'* 



min[A] denotes the k 



minimum 



of set A. 



(4) The estimated b-MWPM at the end of iteration t is = U^^^^E^it) whcic E^{t) = {{«, ji}, •••,{«, jb J} 
is such that N{i) = {jij2, ■ ■ ■ Jdegaii)} and mj^^i{t) < mj^^i{t) ■ ■ < mj^^^^^^^-,i{t). i.e., among all 
«'s neighbors, choose edges to the bi neighbors that transfer the smallest incoming messages to i. 

(5) Repeat (3)-(4) until M{t) converge^ 



In Corollary [U we will show the main intuition behind the equation ^ and how it is derived. But we 
note that one can also use the graphical model representations of [5], [24], [40] to obtain the standard BP 
equations for this problem, which, after some algebraic calculations, yield the recursive equation ([2]). 

The main result of the paper is rather surprising: it says that the above algorithm, which is designed for 
graphs with no cycle (i.e., for trees), works correctly for a much larger family of graphs including those with 
many short cycles. 

Theorem 1 Assume that the LP relaxation (Qp has no fractional solution. Then the algorithm Sync-BP 
converges to M* after at most \^^~\ iterations. 

If the LP relaxation ^ has a fractional solution whose cost is strictly less than Wm*, then [ID], [H] have 
shown for the case of 1-matching that BP does not converge to M*. It is straightforward to generalize this 
to perfect b-matcliing as well. But for the case in which the LP relaxation has a fractional solution whose 
cost is equal to Wm* , BP fails in general. This is because the bf^ minimum in equation ^ is not unique, 
and one needs an oracle to make the right decision. If such an oracle exists, then BP converges to M*. 

In Section[51 we will give and analyze the analogous min-sum algorithms for finding the (possibly non-perfect) 
minimum weight b-matching and in SectionlHl wc will give and analyze the asynchronous version of Sync-BP. 

4 Analysis of the Synchronous BP 

This section contains the analysis of the synchronous BP algorithm for perfect b-matchings. First, in Section 
14.11 we show one derivation of the equations for Sync-BP and its representation in term of the so-called 
computation tree. Next, in Section 14.21 we introduce a technical lemma which connects the complementary 
slackness conditions of Section [2] with alternating paths in the graph G. This lemma is used in Section 14.31 
to prove that, when the LP relaxation has no fractional solutions, then solutions on the computation tree 
are the same as the solutions on the original graph G. 

4.1 Computation Tree and Derivation of Sync-BP 

The main idea behind the algorithm Sync-BP is that it assumes the graph G has no cycle. In other words, 
it finds the b-MWPM of a graph G" that has the same local structure as G but no cycles. In this section we 
rigorously define such graph G" (computation tree) and show its connection with the Sync-BP algorithm. 

*Note that the 6'''-min£gjv{i)\{j} is well defined since we assumed that all trivial vertices are removed and thus there arc at 
least 6i -I- 1 elements in the set N{i) for each i. 

^ The subgraph M{t) is not necessarily a perfect b-matching of G but we will show that after 0{n) iterations it will be the 
minimum weight perfect b-matching. 
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Figure 1: Part (a) shows a graph G where dashed and gray edges represent a 1-matching. Part (b) shows 
the computation tree Tg corresponding to G where the set of dashed and gray edges form a 1-TMWPM. 

Computation Tree. For any i ^ V, let T/ be the t-levcl computation tree corresponding to i, defined as 
follows: r/ is a weighted tree of height t + 1, rooted at i. All tree-nodes have labels from the set {1, . . . , n} 
according to the following recursive rules: 

(a) The root has label i. 

(b) The set of labels of the degdi) children of the root is equal to N{i). 

(c) If s is a non-leaf node whose parent has label r, then the set of labels of its children is iV(s)\{r}. 

Note 1. r/ is often called the unwrapped tree at node i. The computation tree is constructed by replicating 
the local connectivity of the original graph. The messages received by node i in the belief propagation 
algorithm after t iterations in graph G are equivalent to those that would have been received by the root i in 
the computation tree, if the messages were passed up along the tree from the leaves to the root. Computation 
trees have been used in most of the previous analyses of BP algorithms; see e.g. [201 13 Ell EH |53l [T^ . 

A subtree M. of edges in the computation tree T!* is called a perfect tree-h -matching if for each non-leaf 
vertex with label i we have degM(i) ~ bi. Now denote the minimum weight perfect tree-b-matching (b- 
TMWPM) of the computation tree T- by M*{Tl). We will show that Sync-BP can be seen as a dynamic 
programming procedure that finds the minimum weight perfect tree-b-matching over the computation tree. 
Figure [T] shows a graph G and one of its corresponding computation tree. 

Sync-BP Equations. Consider the computation tree T/. Let us assume that degaii) = k, and that 
ii,. . . ,ik are neighbors of i in G which are children of the root i as well. Let us denote the subtree of T/ 
that consists of the root edge and all descendants of ij by T/.^^. Given this, we define the following 

weights and weight differences: 



Clearly, for any edge of graph G the real number nj^i{t) is well-defined; the next lemma shows its 

relation with the messages passed in Sync-BP. 

Lemma 1 For all I < i, j < n such that is an edge of G and all t = 0,1, . . the following is true: 

nj^i{t) = mj^i{t). 

Proof We proceed by induction on t. For t ~ by definition the computation tree T!? has height 1. 
Therefore each branch consists of a sing le root edge {i,ij). Thus W+_,(0) w^^^ and W;:_i{Q) = 

which gives: ni^^i^O) = wu- , and by definition this is equal to mi.^i{Qi). Now for the general case consider 
the computation tree T/ and fix a branch Tl._^^. Denote the children of ij in this branch by ji, . . . ,ji with 



ni.-,i{t) 



Weight of b-TMWPM in T* that contains the root edge {i,ij). 
Weight of b-TMWPM in T/^^j that does not contain the root edge (i, ij). 
W+^,{t)-W,r:_^,it). 
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so that 



£ = degciij) ~ 1 (by rule (c) from the construction of the computation tree described above). For simpHcity 
of notation let a = bi.. Without loss of generality assume that the children ji, . . . , ji are ordered so that 

w+^, {t-l)< W+^, {t-l)<--- < W+^, it - 1). 

Now it is not hard to sec that 

a-l I 
r—1 r—a 

r—1 r— a+1 

= Wii. - rij^^i. {t - 1) 

= wu. - a*''-minrg7v(i,)\{i} (^"j.^i^ " 1)^ • 

Therefore we have shown that variables nj^i{t) satisfy the same recursive relation as variables mj^i(t), 
equation ([2|), and satisfy the same initial conditions. Thus they are equal. I 

It follows immediately from the above lemma that the set of edges Ei {t) which is selected in iteration t of 
the algorithm Sync-BP consists of exactly the same edges which are adjacent to root i in M*{Tl). This is 
formalized in the following corollary. 



Corollary 1 The algorithm Sync-BP solves the h-TMWPM problem on the computation tree. In particular, 
for each vertex i of G, the set of Ei(t) which was chosen at the end of iteration t by Sync-BP is exactly the 
set of bi edges which are attached to the root in h-TMWPM ofTf. 

Corollary [1] characterizes the estimated b-MWPM, M{t), and will be used in the proof of the main result in 
Subsection 14.31 In the next subsection we present a lemma that is crucial for the proofs of Subsection 14.31 



4.2 Main Technical Lemma 

In this section we state our main technical lemma which connects the complementary slackness conditions 
from Section [2] to paths on the graph G and on the computation tree. This lemma is a key step in our proof. 
Its proof is quite delicate, and provides the connection between the absence of fractional solutions and the 
correctness of BP. 

Definition A path P — (ii, i2, . . . , ik) in G is called alternating path if: 

(a) There exist a partition of edges of P into two sets A, B such that either {A C M* , B n M* = 0) or 
(AnM* = , B C M *). Moreover A {B) consists of all odd (even) edges; i.e. A = {(«i, 12), (is, m), ■ . ■} 
{B = {(12, «3), {u,ib), ■ ■ ■})■ 

(b) The path P might intersect itself or even repeat its own edges but no edge is repeated immediately. 
That is, for any I < r < k — 2 : ir ^ ir+i and i,, =/= ir+2- 

P is called an alternating cycle if ii = ik- 
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Lemma 2 Assume that the LP relaxation (1^ has no fractional solution. Then for any alternating path P 
of length at least 2n, there exists an edge {i,j} E P such that the inequality \wij — y* ~ J/j | > holds. That 

Proof We will consider two cases: 

Case I) Existence of an even simple cycle in P. 

Consider the subgraph of G that is generated by edges and vertices of P. If this subgraph contains an 
alternating cycle C that does not intersect itself (simple cycle) and has even length, then we will show that 
(705^0. Let C = {ji, . . . ,j2e,ji)- Without loss of generality assume that odd edges belong to M* and 
even edges do not. That is, for all 1 < r < ^ : 

{«2r-l,i2r} G M* , {«2r , j2r+l } ^ M* 

where j2e+i = ji- To prove C C\ S ^ ^, assume the contrary; that is, assume for all edges [i,]} of C : Wij = 
y* + y*. The weight of M*-edges of C is equal to weight of their complement in C, due to the fact that 

e 21 I 

r—1 s=l r=l 

Now one can obtain a perfect b-matching M' in G which is different from M* and has the same weight as 
M*. This can be done by defining M' = M* outside cycle C, and M' = C\M* on cycle G. However, this 
contradicts the uniqueness assumption for b-MWPM in G which holds due to the fact that the LP relaxation 
has no fractional solution. Hence we are done. 

Case II) There is no even simple cycle in P. 

Let P = {ii,i2, ■ ■ ■ ,ik}- Since P has length at least 2n, it must repeat a vertex. We also add a natural 
direction to each edge {ij, ij+i} that is from ij to ij+i. Consider the first vertex that is revisited by starting 
from ii and walking along P. That is, consider the smallest numbers r, s such that l<r<s<n + l and 
ir = is- Now we break P into three connected pieces as follows: 

(i) Simple path Pq ~ (ii, i2, . . . , ir) (this part will be ignored). 

(ii) Simple cycle C\ ~ {ir, V+i, ■ • ■ , is)- 

(iii) Path Pi = {is+i,is+i, . . . ,ik)- 

From now on we are going to assume that path Pq does not even exist. Basically we will show that there is 
one edge from S which is in Ci U Pi. Since we assumed that P has no even simple cycle, it follows that Ci 
has odd length (s — r is odd). Since the length of P is at least 2n, it follows that P has to intersect itself at 
least twice and there must be another vertex that is revisited after v. Consider the smallest numbers r',s' 
such that r < r' < s' < k and ir' ~ is'- Denote this new simple cycle by C2; i.e., C2 = (vs V'+i, ■ • ■ j*s')- 
Again since G2 is an alternating path, it has to have odd length (s' — r' is odd). 

Now wc claim that s < r'. Again assume the contrary, that r < r' < s. We obtain a contradiction 
by finding an even simple cycle in P. Break path Ci in two simple paths Qi = (v, V+i, ■ • ■ , V') and 
Q2 ~ (V'j*r'+i, ■ • ■ ,is)i and define the simple path = {is,is+i, - - - ,*s')- Now consider the simple cycle 
^3 = Qi U Q3. The length of C3 is equal to r' — r + s' — s, which has the same parity a,s s — r + s' — r\ 
which is even. Therefore C3 is an even cycle. Moreover, the fact that the parities of r' and s' are different 
guarantees the alternation of adjacent edges {v'-i, V'} and {is'-i,is'} in cycle C3. Similarly the difference 
in parity between r and s implies alternation of adjacent edges {v,ir+i} and {is,is+i} in cycle C3. Thus 
C3 is an even length alternating simple cycle, which is a contradiction. So the claim s < r' is proved. 

Now we are left with a final possibility which uses the integrality of the LP optimum solution. Consider the 
following three pieces of path P: 

(i) Simple odd cycle Ci. 
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(ii) Simple path P2 ~ (is+i, is+i, ■ • ■ , V') (could be only a point). 

(iii) Simple odd cycle C2. 



If (Ci UP2UC2)nS' = 0, this means that for all edges {i, j} G Ci UP2UC2, the equality = Vi+Vj holds. 
We will reach a contradiction by showing the existence of an optimum fractional solution for LP relaxation 
([T|). This is done by defining x' as follows: 

r if{i,i}^CiUP2UC2 

[ 0.5 if G Ci nC2. 

First we need to show that x' is a feasible solutions for the LP. For this, all we need to show is that x' 
satisfies the same local constraints as x* on vertices of Ci U P2 U C2. Since all Ci U P2 U C2 is a connected 
alternating path, then for all vertices G Ci U P2 U C2 {i ^ {r, s, r' , s'}) it is clear that x'^g_-^^f, + a^^^^^i) = 
x*t^i-iy + ^£(^+1) = 1- For £ = r, using the fact that length of Ci is odd and path Ci U P2 is an alternating 
sub-path of P, either x*^^^^^ = x*^^^^^ = 1, = or x*f^^^-^^-^ = ^{s-i)s = 0' K{s+i) = 1' which leads 

to K{r+i) = ^[s-i)s = 0-5> = 1 or a;^(^^-^j = x'^^_^^^ = 0.5, = 0, respectively. In both cases, 

x' satisfies same local constraint as x* at ir. A similar argument holds at V'. 

Next we show that x' has the same cost as x* . This is done by applying the equality wtj = y* + y* to all 
edges of Ci U P2 U C2 as follows: 



{ij}eCiuP2UC2 ieCiUP2UC2 



ieCiuC2 {«j}eP2 



{i,j}eCiuC2UP2 



This completes the proof of Lemma [5] 



4.3 Proof of Theorem [T] 

We will prove Theorem [1] namely that if the LP relaxation ^ has no fractional solution and hence M* is 
unique, then Sync-BP converges to the correct b-MWPM. We will do this by showing that if the depth of 
computation tree is large enough, then for any vertex i, its neighbors in AI* (b-MWPM of G) are exactly 
those children that are selected in N*{Tl) (b-TMWPM of T/). Here is the main lemma that summarizes 
the above claim: 

Lemma 3 If the LP relaxation ([ip has no fractional solution, then for any vertex i of G and for any t > 

the set of edges that are adjacent to root i in J\f*(Tl) are exactly those edges that are connected to i in M* . 

The proof of Lemma [3] is the main technical part of this paper. Before entering into the details of the proof 
here is a high level overview of the underlying argument. Consider the computation tree (T/) rooted at vertex 
i and look at M*{Tl). We will assume that the claim of the lemma does not hold. That is, we assume that 
at the root, M*{Tl) does not choose the same edges as M*-edges adjacent to i. Then we use the property 
of perfect tree-b-matchings, namely that each non-leaf vertex j is connected to exactly bj of its neighbors, 
to construct a new perfect tree-b-matching on the computation tree. This new perfect tree-b-matching is 
going to have less total weight if the depth of the computation tree is large enough. This last step uses an 
alternating path argument which is a highly non-trivial generalization of the technique of [S] for the case of 
perfect 1-matching in bipartite graphs. For this part we will use the solutions to the dual LP ([T|). 
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Proof of Lemma [3] Let us denote the lifting of a perfect b-matching M* to a perfect tree-b-matching on 
T/ by A^*. That is, M.* consists of ah edge of the computation tree with endpoint labels i, j such that 
G M* as an edge in G. The goal is to show that J\f*{Tf) and A4* have the same set of edges at the 
root of the computation tree. To lighten the notation, we denote the b-TMWPM of Tf by J\f* . 

Assume the contrary, that there exist children of root i such that € Ai*\Af* and G 

J\f*\M*. Since both M* , M* are perfect tree-b-matchings, they have bi^ edges connected to ii. Therefore 
there exist a child 12 of ii such that {11,12} G M*\M.* . Similarly there is a child z_2 of i-i such that 
{i_i,i_2} G AA*\N'* ■ Therefore we can construct a set of alternating paths Pi, £ > 0, in the computation 
tree, that contain edges from A4* and M* alternatively defined as follows. Let zq = root i and Pq = (*o) be 
a single vertex path. Let Pi = (z_i,io,Ji), P2 = (j-2, J-i, *0) *i) ^2) and similarly for r > 1, define Pjr+i and 
P2r+2 recursively as follows: 

P2r+1 = (?'-(2r+l)7 -F2r, «2r+l) , P2r+2 = («-(2r+2) i ^2r+l , «2r+2) 

where i_(2r+i), i2r+i are nodes at level 2r + 1 such that {i2r,i2r+i} G M*\N'* and {i-2r, «-(2r+i)} G 
M*\M* . Similarly i_(2,+2)i «2r-+2 are nodes at level 2r + 2 such that {?2r+ii *2r+2} G A/'*\A^* and 
{i_(2r+i), *-(2r+2)} G A^*\A/'*. Notc that, by definition, such paths P^ for < ^ < i exist since the 
tree T/ has t+ \ levels and can support a path of length at most 2t as defined above. Now consider the path 
Pt of length 2t. It is an alternating path on the computation tree with edges from M* and A/"*. Let us refer 
to the edges of M* (TV*) as the 7W*-edges (A/'*-edges) of Pt. 

We will now modify the perfect tree- b-matching M* by replacing all A/"* -edges of Pt with their complement 
in Pt (A^*-edges of Pt). It is straightforward that this process produces a new perfect tree-b-matching M' 

Let us assume, for the moment, the following lemma: 

Lemma 4 The weight of the perfect tree-h - matching M' is strictly less than that of M* on T*. 

This completes the proof of Lemma [3] since Lemma |3] shows that M* is not the minimum weight perfect 
tree-b-matching on T/, leading to a contradiction. I 

Now, we provide the proof of Lemma [4l 

Proof of Lemma [4] It suffices to show that the total weight of the A/'*-edges of Pt is more than the total 
weight of A^*-edges of Pj. For each vertex v G Pt consider the value y* from the optimum solution to the 
dual LP III). Using the incquahty Wij < y* + y* for edges of M* , we obtain: 

{i,j}ePtnM' \r=-t / 

where ki is the number of Al*-edges of Pt that belong to S, i.e., the number of A^*-edges of Pt endowed 
with the strict inequality Wij < y* + y* , with a gap of at least e. On the other hand, using the inequality 
Wij > y* + y* for edges of M* we have: 

{i,i}e-PtnA^* / 

where now ^2 is number of A/'*-edges of Pt that belong to S*, or equivalcntly the number of times the inequality 
Wij > yl + y*j is strict with a gap of at least e. One finds 

(a) {h) (c) 

> (fci -I- fc2)e - 2L > (fci + fc2)e - 2L > (5) 
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where (o) uses definition of L from Section [5] and (&) uses the fact that for all i, j : A*, > 0. The main step 
is (c), which uses Lemma [2] as follows. Path Pt has length 2t, and each continuous piece of it with length 
2n has a projection to the graph G which satisfies the conditions of Lemma [21 This means the path has at 
least one edge from the set S. Thus [ki + fc2) > |^ > This completes the proof of Lemma ■ 

4.4 Independence from Initial Conditions. 

We would like to point out that changing the initial condition for the messages in step (2) of Sync-BP to any 
arbitrary values does not change the convergence and correctness of algorithm Sync-BP. The only effect of 
initial condition is on the number of iterations needed for convergence. Theorem[T]remains true by re-defining 
L according to: L = maxi<i<„ \y* \ + max^j ^jg^; |mi^j(0)|. This follows because, by changing the initial 
condition, the algorithm Sync-BP runs over a slightly modified computation tree. The new computation 
tree is almost the same computation tree as T/, except that the leaf edges of the tree have arbitrary weights 
and not w^j's from G. In the proof of Lemma O the only place where the weight of leaf edges appears is the 
inequality (a) in equation (O, which will be satisfied by new definition of L. 

4.5 Sync-BP is Correct When e is Not Weil-Defined 

Recall from the discussion in Section [2] that, if for all edges G E the equality Wij = y* +y* holds, then 

e is not well defined. In this section we show that these rare cases do not cause any trouble. We will show 
that the condition t > in the main theorem can be replaced by t > n. This is shown by proving the 
following lemma instead of Lemma [3l 

Lemma 5 // the LP relaxation {Ip has no fractional solution, then, for any vertex i of G and for any t > n, 
the set of edges that are adjacent to root i in J\f*(Tl) are exactly those edges that are connected to i in M* . 

Proof The proof is similar to the proof of Lemma [31 If after iteration t, the claim of the Lemma [5| does 
not hold, then the alternating path Pt can be constructed as before. Now since the length of Pt is greater 
than 2n, one can use the technical Lemma [2| for the projection of the path Pt onto G to show that the strict 
inequality \wij — y* — yj \ > happens for at least one edge. This contradicts the above assumption at the 
beginning of the Section. Therefore Lemma [5| is true. I 

5 Extension to Possibly Non-Perfect b-Matchings 

In this section we show that the algorithm and the results of the previous sections can be easily generalized 
to the case of b-matchings (subgraphs H oi G such that degree of each vertex i in H is at most hi). Let 
U {H) C y be the set of unsaturated vertices of G (vertices i € V such that degnii) < bi). Similar to Section 
[21 the minimum weight b-Matching (b-MWM), H* , is the b-Matching such that 

H* = argmax^gMG(b) Wh- 

Note that H* does not include any edge with positive weight because removing such edges from H* reduces 
its weight while keeping it a b-matching. Therefore in this section we assume that for all {«, j} G : Wij < 0. 
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The LP relaxation is slightly different from before: 



max YTi=i -biVt - T,{i^j}eE -^y 
subject to 

Wij + A.y > -yj - yj V {i,j} G E 

A.y>0 W{i,j}eE 

Dual LP 

(6) 

Complementary slackness now reads, for all {i, j} £ E: x*j(wij + X*j + y* + y*) — 0, {x*j — l)A*j = and 
for aU ^ G V: {Y.jeN(i) " bi)v* = 0. 

Similarly to Section [2l we can write the following modified complementary slackness condition using the fact 
that the LP relaxation has no fractional solution: 

(CS'-i) For all j} G H*- w,, + X*^ + y* + y* = 0. 
(CS'-u) For aU {i,j} ^ H*- A*, = 0. 
(CS'-iii) For all i G U{H*)- y* = 0. 

Let S' be set of those edges in G for which \wij ^y* + y*j \ > 0. We will assume the minimum gap is e'. That 
is 

< e' = min |w,, + y* +y*\- 

The quantity L' is defined similarly to L by i' = maxi<i<„ |y*|. 

Now we can present the modified algorithm Sync-BP for finding b-MWM in G: 



subject to 

< < 1 \f{i,j}eE 
Primal LP 



Algorithm Sync-BP(2). 



(1) At times t = 0, 1, . . ., each vertex sends real-valued messages to each of its neighbors. The message of 
i to j at time t is denoted by rrii^j (t) . 

(2) Messages are initialized by mi^j(O) = Wy for all G E. 

(3) For t > 1, messages in iteration t are obtained from messages in iteration t — 1 recursively as follows: 



V {i, j} G E : mi-^j{t) = Wij - min ^0, 6*''-minfgjv(i)\{j} mf^i{t ^ 1) ^ 



(7) 



where fc -min(A) denotes the fc^ minimurrO of set A. 



(4) The estimated b-MWM at the end of iteration t is = U7^iFi(t) where = {{«, ji}, • ■ • , {«, jcj} 
is such that mji,^i{t) < for all 1 < £ < Ci, i.e., choose edges that transfer negative messages to i. 

(5) Repeat (3)-(4) until H{t) converges. 



The results for b-matchings generalize as follows: 
^Here 6^''-min^gjv(i)\{j} is defined to be if dega^i) = hi. 
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Theorem 2 Assume that the LP relaxation has no fractional solution. Then the algorithm Sync-BP(2) 
converges to H* after at most [ ^"^ ] iterations. 

The proof of Theorem [2] is shiiilar to the one of Section |4] with the following modifications: 

1. The computation tree T/ and b-TMWM arc defined as before, while Lemma [T] is slightly modified. A 
careful analysis of W'^ and W~ for the tree-b-matchings yields equations ([7]) for finding b-TMWM in 
the computation tree. This is how the new equations are obtained. 

2. The technical lemma from Section [4.21 is still true and its proof does not change because the definition 
of alternating paths is preserved and because all cycles involved in the proof turn out to be adjacent 
to exactly one edge of H*. 

3. The proof of Lemmas [3] and [4] should be slightly modified. In particular, the alternating path Pt can 
be different: One can show that if the b-TMWM J\f*{Tl) and the tree-b-matching H* choose different 
sets of edges at the root i, then an alternating path can be constructed as before in T/ which includes 
the root i. But endpoints of this alternating path Pt are either leaves of T/ or vertices inside T* which 
have labels from U{H*) (are un-saturated in G by H*). In the case in which there is at least one leaf 
as an endpoint of Pt, the same argument as equation ([5]) in Section l43l can be used since length of Pt is 
at least t. This shows (fci -I- ^2) > ^ > ^h- But in the case in which both endpoints of Pt are non-leaf 
vertices of the computation tree, then using condition (CS'-iii), the analogous version of equation ^ 
is as follows: 

~ = + ^2)e- (8) 

{jj'lePtnA"* {L]}ePtnH* 

Now all that is needed is to show ki+k^ > 0. We will show this by the following extension of technical 
Lemma [5] 

Lemma 6 Assume that the LP relaxation (0) has no fractional solution. Then for any alternating 
path P with endpoints from the set U{H*), there exists an edge {i,j} G P such that the inequality 
|wy + y* +y*\ > holds. That is, Pr\S' 

Proof For paths P with length at least 2n, we can use Lemma [31 so there is nothing to do. If a 
subgraph generated by P includes at least two cycles, then the same argument as in the proof of 
Lemma [5] can be used. Therefore we can assume P intersects itself at most once. So P can be written 
as a union C U Pi where C is an odd simple alternating cycle and Pi is a simple alternating path 
(either C or Pi can be empty, but not at the same time). Next, one can define a different solution x' 
to the LP ^ which has the same cost as x* by defining x' = 1 — x* on path Pi and setting x' equal 
to 0.5 on C. x' will still be a feasible solution since the endpoints of Pi are elements of U{H*) and the 
edge adjacent to them in path Pi is not in H* . This contradicts the no fractional solution assumption 
on the LP. ■ 



6 Analysis of the Asynchronous BP 

In this section we study the asynchronous version of the BP algorithm. The update equations are exactly 
analogous to the synchronous version, but at each time only a subset of the edges are updated in an arbitrary 
order. Consider the set E of all directed edges in the G; i.e., E ~ {{i ^ j) s.t. i ^ j ^ V}. Let A be 
a sequence E{1), E{2), ... of subsets of the set E. Then the asynchronous BP algorithm corresponding to 
the sequence A can be obtained by modifying only the step (3) of the algorithm Sync-BP for the perfect 
b-matchings: 
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(3) For t > 1, messages in iteration t are obtained from messages in iteration t — 1 recursively as follows: 



lit) 



mi^i(t - 1) 



if {t -> j) e E{t) 



j{t — 1) otherwise 



Note 2. This is the most general form of the asynchronous BP and it includes the synchronous version 
{E{t) — E for all i = 1, 2, . . .) as a special case. In many applications, a special case of the asynchronous BP 
is used for which each set E{t) consists of a single element. 

We assume that the sequence A of the updates does not have redundancies. That is, no edge direction 
[i ^ j) & E is re-updated before at least one of its incoming edge directions {{£ — > i) for £ G is 
updated. More formally, if (i j) G E{t) n E{t + s) and (i j) ^ L}'^j.z\E{t + r), then at least for one 
£ e N{i)\{j}, we should have {£ ^ i) e U'^zlE{t + r). 

Let us denote the above algorithm by Async-BP. We claim that, if each edge direction (i ~* j) E E is updated 
9{n) times, then the same result as Theorem[T]can be proved here. That is, let u{t) be the minimum number 
of times that an edge direction of the graph G appears in the sequence E{1), . . . , E{t); i.e., 

u{t) = min ( {£ : s.t. l<£<t and (i ^ j) e E{£)} ). 

From the definition, u(t) is a non-decreasing function of t. We claim that the following result holds: 

Theorem 3 Assume that the LP relaxation (Qp has no fractional solution. Then the algorithm Async-BP 
converges to M* after at most t iterations, provided u(t) > 2Hii_ 

Before proving the above theorem let us define the notion of generalized computation tree for the asyn- 
chronous version of the BP algorithm. 



6.1 Generalized Computation Tree for the Asynchronous BP 

In order to define the generalized computation tree (GCT) for the asynchronous BP, we will begin with some 
definitions. For any (i — s- j) G E{t), define Rl^j to be the computation branch of i to j at time t which 
is a weighted rooted tree (not necessarily a balanced rooted tree) and recursively defined according to the 
following rules: 

(a) The root has label j. 

(b) The root has only one child which has label i. 

(c) If t = 0, then the child i has no child {R^^j is .just a single edge {z, j}). 

(d) For t > 0, if (i ^ j) ^ E{t) then = R\^j- Otherwise the child i has degaii) — 1 children which 
have all of labels in the set N{i)\{i} and for any child r of i the subtree that consists of all descendants of 
r and the edge {r, i} to i is RlZ^i- 

The edge between nodes labeled i,j in the tree is assigned weight Wij for 1 < i < j < n. Now for any vertex 
i G V and any t, the GCT i?* is a weighted rooted tree with root i such that all its branches starting from 
the root are the computation branches for all r € ^{i). Since the GCT i?* is not necessarily balanced, 

we will define its depth to be the length of the shortest path from the root i to a leaf and denote it by d{R\). 

Similarly to the Section 14. H we can define the minimum weight perfect trec-b-matching (b-TMWPM) for 
GCT R\ and denote it by Ai*{R\). Moreover, arguments similar to the ones in the Section B?T] show that the 
algorithm Async-BP is solving the b-TMWPM for GCTs i?*. In other words, the following corollary holds: 

Corollary 2 The algorithm Async-BP solves the h-TMWPM problem on the GCT. In particular, for each 
vertex i of the G, the set Ei(t) that was chosen at the end of iteration t by Async-BP is exactly the set of bi 
edges that are attached to the root in b- TMWPM of Rj . 
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6.2 Technical Analysis of the Asynchronous BP 

Now we can use the same analysis as in Section [4.3l to show that if the depth of the generaUzed computation 
tree (GCT) is large enough, then for any vertex i, its neighbors in M* (b-MWPM of G) are exactly those 
children that are selected in 7V*(i?-) (b-TMWPM of i?*). We wiU show this by relating the function u{t) to 
the depth of the GCT Here is the main lemma which summarizes the above claim: 

Lemma 7 // the LP relaxation has no fractional solution, then for any vertex i of G and for any t 
such that u{t) > the set of edges that are adjacent to root i in J\f*[R\) are exactly those edges that are 
connected to i in M* . 

The proof of Lemma [7] is similar to the proof of Lemma [31 with the following slight modifications: 

(i) One can construct alternating paths Pi in the same way as before for 1 < ^ < d{R\). 

(ii) The depth of the GCT R\ is related to u{t) according to the following lemma: 

Lemma 8 For any vertex i (z V and any t, the depth of any computation branch at time t is at least u{t); 
I.e., > uit). 

This tells us d{Rl) > u{t) > Applying this to the path Pd{B*)^ analogously to the use of Lemma [4] in 

the proof of Lemma [3l gives us the proof of Lemma [T] Therefore all that is needed is a proof of Lemma [51 

Proof of Lemma [HI The proof follows easily by looking at the construction of the computation branch. 
Each computation branch R\^j grows at time t if (i j) G E{t). And if this is the case, the depth increases 
by at least one due to the "no redundancy condition" on the updating sequence. So if each edge is updated 
at least u{t) times then the depth of its computation branch grows by at least u{t). I 

Finally we note that the same algorithm as Async-BP and the same result as Theorem [31 can be stated and 
proved for the (possibly non-perfect) b-matchings as well. 
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Appendix 



A Counterexample 

To see that strictness of inequality Wij > y*+y* from Scction[2]does not hold in general, consider the case of 1- 
matching on the complete graph with four vertices and weights wi2 = W2a ~ 1^34 = a , wi^ = wi/^ = W23 = b 
where b ^ a. It is clear that AI* = {(1,2), (3,4)} is the unique minimum weight matching. Consider the 
following, feasible solutions to the LP and its dual: x'12 = 2:34 = 1 , 2^13 = x'l^ = 0:23 = X24^ = 0, X' — 0, and 
y' = {a/2, a/2, a/2, a/2). These solution satisfy the complementary slackness conditions. Therefore x* = x' 
and A* = A', y* = y'. But for the edge (2,4) ^ M* the equality W24: = 1/2+^4 holds. The same example can 
be used for the case of (non-perfect) matching as well. 
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